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SEQUENCE LISTING 
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<130> D-0021.5 PCT 

<140> PCT/US99/24879 
<141> 1999-10-22 

<150> 09/177,776 
<151> 1998-10-23 

<150> 09/178,115 
<151> 1998-10-23 

<160> 143 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1522 
<212> DNA 
<213> HUMAN 

<220> 

<221> CDS 

<222> (13) . . (1389) 

<220> 

<221> mat_peptide 
<222> (124) . . (1389) 

<400> 1 

acagtcagcc gc atg get ccc ctg tgc ccc age ccc tgg etc cct ctg ttg 51 

Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu 

-35 -30 -25 

ate ccg gee cct get cca ggc etc act gtg caa ctg ctg ctg tea ctg 99 
lie Pro Ala Pro Ala Pro Gly Leu Thr Val Gin Leu Leu Leu Ser Leu 

-20 -15 -10 
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ctg ctt ctg atg cct gtc cat ccc cag agg ttg ccc egg atg cag gag 147 
Leu Leu Leu Met Pro Val His Pro Gin Arg Leu Pro Arg Met Gin Glu 
-5 -11 5 

gat tec ccc ttg gga gga ggc tct tct ggg gaa gat gac cca ctg ggc 195 

Asp Ser Pro Leu Gly Gly Gly Ser Ser Gly Glu Asp Asp Pro Leu Gly 
10 15 20 

gag gag gat ctg ccc agt gaa gag gat tea ccc aga gag gag gat cca 243 

Glu Glu Asp Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro 

25 30 35 40 

ccc gga gag gag gat eta cct gga gag gag gat eta cct gga gag gag 2 91 

Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu 

45 50 55 

gat eta cct gaa gtt aag cct aaa tea gaa gaa gag ggc tec ctg aag 33 9 

Asp Leu Pro Glu Val Lys Pro Lys Ser Glu Glu Glu Gly Ser Leu Lys 
60 65 70 

tta gag gat eta cct act gtt gag get cct gga gat cct caa gaa ccc 387 

Leu Glu Asp Leu Pro Thr Val Glu Ala Pro Gly Asp Pro Gin Glu Pro 
75 80 85 

cag aat aat gee cac agg gac aaa gaa ggg gat gac cag agt cat tgg 435 

Gin Asn Asn Ala His Arg Asp Lys Glu Gly Asp Asp Gin Ser His Trp 
90 95 100 

cgc tat gga ggc gac ccg ccc tgg ccc egg gtg tec cca gee tgc gcg 483 

Arg Tyr Gly Gly Asp Pro Pro Trp Pro Arg Val Ser Pro Ala Cys Ala 

105 110 115 120 

ggc cgc ttc cag tec ccg gtg gat ate cgc ccc cag etc gee gec ttc 531 

Gly Arg Phe Gin Ser Pro Val Asp lie Arg Pro Gin Leu Ala Ala Phe 

125 130 135 

tgc ccg gee ctg cgc ccc ctg gaa etc ctg ggc ttc cag etc ccg ccg 579 

Cys Pro Ala Leu Arg Pro Leu Glu Leu Leu Gly Phe Gin Leu Pro Pro 
140 145 150 

etc cca gaa ctg cgc ctg cgc aac aat ggc cac agt gtg caa ctg ace 627 

Leu Pro Glu Leu Arg Leu Arg Asn Asn Gly His Ser Val Gin Leu Thr 
155 160 165 
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ctg cct cct ggg eta gag atg get ctg ggt ccc ggg egg gag tac egg 675 
Leu Pro Pro Gly Leu Glu Met Ala Leu Gly Pro Gly Arg Glu Tyr Arg 
170 175 180 

get ctg cag ctg cat ctg cac tgg ggg get gca ggt cgt ccg ggc teg 723 
Ala Leu Gin Leu His Leu His Trp Gly Ala Ala Gly Arg Pro Gly Ser 
185 190 195 200 

gag cac act gtg gaa ggc cac cgt ttc cct gec gag ate cac gtg gtt 771 
Glu His Thr Val Glu Gly His Arg Phe Pro Ala Glu lie His Val Val 

205 210 215 

cac etc age ace gee ttt gee aga gtt gac gag gee ttg ggg cgc ccg 819 
His Leu Ser Thr Ala Phe Ala Arg Val Asp Glu Ala Leu Gly Arg Pro 
220 225 230 

gga ggc ctg gee gtg ttg gee gee ttt ctg gag gag ggc ccg gaa gaa 867 
Gly Gly Leu Ala Val Leu Ala Ala Phe Leu Glu Glu Gly Pro Glu Glu 
235 240 245 

aac agt gee tat gag cag ttg ctg tct cgc ttg gaa gaa ate get gag 915 
Asn Ser Ala Tyr Glu Gin Leu Leu Ser Arg Leu Glu Glu lie Ala Glu 
250 255 260 

gaa ggc tea gag act cag gtc cca gga ctg gac ata tct gca etc ctg 963 
Glu Gly Ser Glu Thr Gin Val Pro Gly Leu Asp lie Ser Ala Leu Leu 
265 270 275 280 

ccc tct gac ttc age cgc tac ttc caa tat gag ggg tct ctg act aca 1011 
Pro Ser Asp Phe Ser Arg Tyr Phe Gin Tyr Glu Gly Ser Leu Thr Thr 

285 290 295 

ccg ccc tgt gec cag ggt gtc ate tgg act gtg ttt aac cag aca gtg 1059 
Pro Pro Cys Ala Gin Gly Val He Trp Thr Val Phe Asn Gin Thr Val 
300 305 310 

atg ctg agt get aag cag etc cac ace etc tct gac ace ctg tgg gga 1107 
Met Leu Ser Ala Lys Gin Leu His Thr Leu Ser Asp Thr Leu Trp Gly 
315 320 325 

cct ggt gac tct egg eta cag ctg aac ttc cga gcg acg cag cct ttg 1155 
Pro Gly Asp Ser Arg Leu Gin Leu Asn Phe Arg Ala Thr Gin Pro Leu 
330 335 340 
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aat ggg cga 
Asn Gly Arg 
345 

cct egg get 
Pro Arg Ala 



gac ate eta 
Asp lie Leu 



gcg ttc ctt 
Ala Phe Leu 
395 

ggt gtg age 
Gly Val Ser 
410 



gtg att gag 
Val He Glu 
350 

get gag cca 
Ala Glu Pro 
365 

gee ctg gtt 
Ala Leu Val 
380 

gtg cag atg 
Val Gin Met 



tac cgc cca 
Tyr Arg Pro 



gee tec ttc 
Ala Ser Phe 



gtc cag ctg 
Val Gin Leu 



ttt ggc etc 
Phe Gly Leu 
385 

aga agg cag 
Arg Arg Gin 
400 

gca gag gta 
Ala Glu Val 
415 



cct get gga 
Pro Ala Gly 
355 

aat tec tgc 
Asn Ser Cys 
370 

ctt ttt get 
Leu Phe Ala 



cac aga agg 
His Arg Arg 



gee gag act 
Ala Glu Thr 
420 



gtg gac age 
Val Asp Ser 



ctg get get 
Leu Ala Ala 
375 

gtc ace age 
Val Thr Ser 
390 

gga acc aaa 
Gly Thr Lys 
405 

gga gee 
Gly Ala 



agt 12 03 

Ser 

360 

ggt 12 51 
Gly 



gtc 1299 
Val 



ggg 1347 
Gly 



1389 



tagaggctgg atcttggaga atgtgagaag ccagccagag gcatctgagg gggagccggt 144 9 
aactgtcctg tcctgctcat tatgecaett ccttttaact gecaagaaat tttttaaaat 1509 
aaatatttat aat 1522 



<210> 2 
<211> 459 
<212> PRT 
<213> HUMAN 

<400> 2 

Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu He Pro Ala 
-35 -30 -25 

Pro Ala Pro Gly Leu Thr Val Gin Leu Leu Leu Ser Leu Leu Leu Leu 
-20 -15 -10 

Met Pro Val His Pro Gin Arg Leu Pro Arg Met Gin Glu Asp Ser Pro 
-5 -11 5 10 

Leu Gly Gly Gly Ser Ser Gly Glu Asp Asp Pro Leu Gly Glu Glu Asp 
15 20 25 
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it 



Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu 
30 35 40 

Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro 
45 50 55 

Glu Val Lys Pro Lys Ser Glu Glu Glu Gly Ser Leu Lys Leu Glu Asp 
60 65 70 75 

Leu Pro Thr Val Glu Ala Pro Gly Asp Pro Gin Glu Pro Gin Asn Asn 

80 85 90 

Ala His Arg Asp Lys Glu Gly Asp Asp Gin Ser His Trp Arg Tyr Gly 
95 100 105 

Gly Asp Pro Pro Trp Pro Arg Val Ser Pro Ala Cys Ala Gly Arg Phe 
110 115 120 

Gin Ser Pro Val Asp lie Arg Pro Gin Leu Ala Ala Phe Cys Pro Ala 
125 130 135 

Leu Arg Pro Leu Glu Leu Leu Gly Phe Gin Leu Pro Pro Leu Pro Glu 
140 145 150 155 

Leu Arg Leu Arg Asn Asn Gly His Ser Val Gin Leu Thr Leu Pro Pro 

160 165 170 

Gly Leu Glu Met Ala Leu Gly Pro Gly Arg Glu Tyr Arg Ala Leu Gin 
175 180 185 

Leu His Leu His Trp Gly Ala Ala Gly Arg Pro Gly Ser Glu His Thr 
190 195 200 

Val Glu Gly His Arg Phe Pro Ala Glu He His Val Val His Leu Ser 
205 210 215 

Thr Ala Phe Ala Arg Val Asp Glu Ala Leu Gly Arg Pro Gly Gly Leu 
220 225 230 235 

Ala Val Leu Ala Ala Phe Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala 

240 245 250 



Tyr Glu Gin Leu Leu Ser Arg Leu Glu Glu He Ala Glu Glu Gly Ser 
255 260 265 
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Glu Thr Gin Val 
270 

Phe Ser Arg Tyr 
285 

Ala Gin Gly Val 
300 

Ala Lys Gin Leu 



Ser Arg Leu Gin 
335 

Val He Glu Ala 
350 

Ala Glu Pro Val 
365 

Ala Leu Val Phe 
380 

Val Gin Met Arg 



Tyr Arg Pro Ala 
415 



Pro Gly Leu Asp 
275 

Phe Gin Tyr Glu 
290 

He Trp Thr Val 
305 

His Thr Leu Ser 
320 

Leu Asn Phe Arg 



Ser Phe Pro Ala 
355 

Gin Leu Asn Ser 
370 

Gly Leu Leu Phe 
385 

Arg Gin His Arg 
400 

Glu Val Ala Glu 



He Ser Ala Leu 



Gly Ser Leu Thr 
295 

Phe Asn Gin Thr 
310 

Asp Thr Leu Trp 
325 

Ala Thr Gin Pro 
340 

Gly Val Asp Ser 



Cys Leu Ala Ala 
375 

Ala Val Thr Ser 
390 

Arg Gly Thr Lys 
405 

Thr Gly Ala 
420 



Leu Pro Ser Asp 
280 

Thr Pro Pro Cys 



Val Met Leu Ser 
315 

Gly Pro Gly Asp 
330 

Leu Asn Gly Arg 
345 

Ser Pro Arg Ala 
360 

Gly Asp He Leu 



Val Ala Phe Leu 
395 

Gly Gly Val Ser 
410 



<210> 3 
<211> 29 
<212> DNA 
<213> HUMAN 

<400> 3 

cgcccagtgg gtcatcttcc ccagaagag 2 9 



<210> 4 

<211> 19 

<212> DNA 

<213> HUMAN 
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<400> 4 

ggaatcctcc tgcatccgg 19 



<210> 5 
<211> 10898 
<212> DNA 
<213> HUMAN 

<220> 

<221> gene 

<222> (1) . . (10898) 

<223> full-length MN genomic sequence 
<220> 

<22l> unsure of base at position 1974 
<222> (1974) 

<223> unsure of base at position 1974, which is in the 5' region 
flanking the transcription initiation site (3507) as 
determined by RNase protection assay. 

<400> 5 

ggatcctgtt gactcgtgac cttaccccca accctgtgct ctctgaaaca tgagctgtgt 60 
ccactcaggg ttaaatggat taagggcggt gcaagatgtg ctttgttaaa cagatgcttg 120 
aaggcagcat gctcgttaag agtcatcacc aatccctaat ctcaagtaat cagggacaca 180 
aacactgcgg aaggccgcag ggtcctctgc ctaggaaaac cagagacctt tgttcacttg 240 
tttatctgac cttccctcca ctattgtcca tgaccctgcc aaatccccct ctgtgagaaa 300 
cacccaagaa ttatcaataa aaaaataaat ttaaaaaaaa aatacaaaaa aaaaaaaaaa 3 60 
aaaaaaaaaa gacttacgaa tagttattga taaatgaata gctattggta aagccaagta 42 0 
aatgatcata ttcaaaacca gacggccatc atcacagctc aagtctacct gatttgatct 480 
ctttatcatt gtcattcttt ggattcacta gattagtcat catcctcaaa attctccccc 540 
aagttctaat tacgttccaa acatttaggg gttacatgaa gcttgaacct actaccttct 600 
ttgcttttga gccatgagtt gtaggaatga tgagtttaca ccttacatgc tggggattaa 660 
tttaaacttt acctctaagt cagttgggta gcctttggct tatttttgta gctaattttg 720 
tagttaatgg atgcactgtg aatcttgcta tgatagtttt cctccacact ttgccactag 780 
gggtaggtag gtactcagtt ttcagtaatt gcttacctaa gaccctaagc cctatttctc 840 
ttgtactggc ctttatctgt aatatgggca tatttaatac aatataattt ttggagtttt 900 
tttgtttgtt tgtttgtttg tttttttgag acggagtctt gcatctgtca tgcccaggct 960 
ggagtagcag tggtgccatc tcggctcact gcaagctcca cctcccgagt tcacgccatt 102 0 
ttcctgcctc agcctcccga gtagctggga ctacaggcgc ccgccaccat gcccggctaa 1080 
ttttttgtat ttttggtaga gacggggttt caccgtgtta gccagaatgg tctcgatctc 1140 
ctgacttcgt gatccacccg cctcggcctc ccaaagttct gggattacag gtgtgagcca 1200 
ccgcacctgg ccaatttttt gagtctttta aagtaaaaat atgtcttgta agctggtaac 1260 
tatggtacat ttccttttat taatgtggtg ctgacggtca tataggttct tttgagtttg 1320 
gcatgcatat gctacttttt gcagtccttt cattacattt ttctctcttc atttgaagag 1380 
catgttatat cttttagctt cacttggctt aaaaggttct ctcattagcc taacacagtg 144 0 
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tcattgttgg taccacttgg atcataagtg gaaaaacagt caagaaattg cacagtaata 1500 
cttgtttgta agagggatga ttcaggtgaa tctgacacta agaaactccc ctacctgagg 1560 
tctgagattc ctctgacatt gctgtatata ggcttttcct ttgacagcct gtgactgcgg 1620 
actatttttc ttaagcaaga tatgctaaag ttttgtgagc ctttttccag agagaggtct 1680 
catatctgca tcaagtgaga acatataatg tctgcatgtt tccatatttc aggaatgttt 1740 
gcttgtgttt tatgctttta tatagacagg gaaacttgtt cctcagtgac ccaaaagagg 1800 
tgggaattgt tattggatat catcattggc ccacgctttc tgaccttgga aacaattaag 1860 
ggttcataat ctcaattctg tcagaattgg tacaagaaat agctgctatg tttcttgaca 1920 
ttccacttgg taggaaataa gaatgtgaaa ctcttcagtt ggtgtgtgtc cctngttttt 1980 
ttgcaatttc cttcttactg tgttaaaaaa aagtatgatc ttgctctgag aggtgaggca 2 04 0 
ttcttaatca tgatctttaa agatcaataa tataatcctt tcaaggatta tgtctttatt 2100 
ataataaaga taatttgtct ttaacagaat caataatata atcccttaaa ggattatatc 2160 
tttgctgggc gcagtggctc acacctgtaa tcccagcact ttgggtggcc aaggtggaag 222 0 
gatcaaattt gcctacttct atattatctt ctaaagcaga attcatctct cttccctcaa 2280 
tatgatgata ttgacagggt ttgccctcac tcactagatt gtgagctcct gctcagggca 234 0 
ggtagcgttt tttgtttttg tttttgtttt tcttttttga gacagggtct tgctctgtca 2400 
cccaggccag agtgcaatgg tacagtctca gctcactgca gcctcaaccg cctcggctca 2460 
aaccatcatc ccatttcagc ctcctgagta gctgggacta caggcacatg ccattacacc 2 52 0 
tggctaattt ttttgtattt ctagtagaga cagggtttgg ccatgttgcc cgggctggtc 2580 
tcgaactcct ggactcaagc aatccaccca cctcagcctc ccaaaatgag ggaccgtgtc 264 0 
ttattcattt ccatgtccct agtccatagc ccagtgctgg acctatggta gtactaaata 2700 
aatatttgtt gaatgcaata gtaaatagca tttcagggag caagaactag attaacaaag 2 760 
gtggtaaaag gtttggagaa aaaaataata gtttaatttg gctagagtat gagggagagt 2 82 0 
agtaggagac aagatggaaa ggtctcttgg gcaaggtttt gaaggaagtt ggaagtcaga 2 880 
agtacacaat gtgcatatcg tggcaggcag tggggagcca atgaaggctt ttgagcagga 294 0 
gagtaatgtg ttgaaaaata aatataggtt aaacctatca gagcccctct gacacataca 3 000 
cttgcttttc attcaagctc aagtttgtct cccacatacc cattacttaa ctcaccctcg 3060 
ggctccccta gcagcctgcc ctacctcttt acctgcttcc tggtggagtc agggatgtat 312 0 
acatgagctg ctttccctct cagccagagg acatgggggg ccccagctcc cctgcctttc 318 0 
cccttctgtg cctggagctg ggaagcaggc cagggttagc tgaggctggc tggcaagcag 3240 
ctgggtggtg ccagggagag cctgcatagt gccaggtggt gccttgggtt ccaagctagt 33 00 
ccatggcccc gataaccttc tgcctgtgca cacacctgcc cctcactcca cccccatcct 3360 
agctttggta tgggggagag ggcacagggc cagacaaacc tgtgagactt tggctccatc 342 0 
tctgcaaaag ggcgctctgt gagtcagcct gctcccctcc aggcttgctc ctcccccacc 348 0 
cagctctcgt ttccaatgca cgtacagccc gtacacaccg tgtgctggga caccccacag 3 54 0 
tcagccgcat ggctcccctg tgccccagcc cctggctccc tctgttgatc ccggcccctg 3 600 
ctccaggcct cactgtgcaa ctgctgctgt cactgctgct tctggtgcct gtccatcccc 3660 
agaggttgcc ccggatgcag gaggattccc ccttgggagg aggctcttct ggggaagatg 372 0 
acccactggg cgaggaggat ctgcccagtg aagaggattc acccagagag gaggatccac 3780 
ccggagagga ggatctacct ggagaggagg atctacctgg agaggaggat ctacctgaag 384 0 
ttaagcctaa atcagaagaa gagggctccc tgaagttaga ggatctacct actgttgagg 3 900 
ctcctggaga tcctcaagaa ccccagaata atgcccacag ggacaaagaa ggtaagtggt 3 960 
catcaatctc caaatccagg ttccaggagg ttcatgactc ccctcccata ccccagccta 4020 
ggctctgttc actcagggaa ggaggggaga ctgtactccc cacagaagcc cttccagagg 4080 
tcccatacca atatccccat ccccactctc ggaggtagaa agggacagat gtggagagaa 4140 
aataaaaagg gtgcaaaagg agagaggtga gctggatgag atgggagaga agggggaggc 4200 
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tggagaagag aaagggatga gaactgcaga tgagagaaaa aatgtgcaga cagaggaaaa 4260 
aaataggtgg agaaggagag tcagagagtt tgaggggaag agaaaaggaa agcttgggag 432 0 
gtgaagtggg taccagagac aagcaagaag agctggtaga agtcatctca tcttaggcta 4380 
caatgaggaa ttgagaccta ggaagaaggg acacagcagg tagagaaacg tggcttcttg 4440 
actcccaagc caggaatttg gggaaagggg ttggagacca tacaaggcag agggatgagt 4500 
ggggagaaga aagaagggag aaaggaaaga tggtgtactc actcatttgg gactcaggac 4560 
tgaagtgccc actcactttt tttttttttt tttttgagac aaactttcac ttttgttgcc 4620 
caggctggag tgcaatggcg cgatctcggc tcactgcaac ctccacctcc cgggttcaag 4 680 
tgattctcct gcctcagcct ctagccaagt agctgcgatt acaggcatgc gccaccacgc 4740 
ccggctaatt tttgtatttt tagtagagac ggggtttcgc catgttggtc aggctggtct 4800 
cgaactcctg atctcaggtg atccaaccac cctggcctcc caaagtgctg ggattatagg 4860 
cgtgagccac agcgcctggc ctgaagcagc cactcacttt tacagaccct aagacaatga 4 92 0 
ttgcaagctg gtaggattgc tgtttggccc acccagctgc ggtgttgagt ttgggtgcgg 4 980 
tctcctgtgc tttgcacctg gcccgcttaa ggcatttgtt acccgtaatg ctcctgtaag 5040 
gcatctgcgt ttgtgacatc gttttggtcg ccaggaaggg attggggctc taagcttgag 5100 
cggttcatcc ttttcattta tacaggggat gaccagagtc attggcgcta tggaggtgag 5160 
acacccaccc gctgcacaga cccaatctgg gaacccagct ctgtggatct cccctacagc 5220 
cgtccctgaa cactggtccc gggcgtccca cccgccgccc accgtcccac cccctcacct 52 80 
tttctacccg ggttccctaa gttcctgacc taggcgtcag acttcctcac tatactctcc 5340 
caccccaggc gacccgccct ggccccgggt gtccccagcc tgcgcgggcc gcttccagtc 5400 
cccggtggat atccgccccc agctcgccgc cttctgcccg gccctgcgcc ccctggaact 5460 
cctgggcttc cagctcccgc cgctcccaga actgcgcctg cgcaacaatg gccacagtgg 5520 
tgagggggtc tccccgccga gacttgggga tggggcgggg cgcagggaag ggaaccgtcg 5580 
cgcagtgcct gcccgggggt tgggctggcc ctaccgggcg gggccggctc acttgcctct 564 0 
ccctacgcag tgcaactgac cctgcctcct gggctagaga tggctctggg tcccgggcgg 5700 
gagtaccggg ctctgcagct gcatctgcac tggggggctg caggtcgtcc gggctcggag 5760 
cacactgtgg aaggccaccg tttccctgcc gaggtgagcg cggactggcc gagaaggggc 582 0 
aaaggagcgg ggcggacggg ggccagagac gtggccctct cctaccctcg tgtccttttc 5880 
agatccacgt ggttcacctc agcaccgcct ttgccagagt tgacgaggcc ttggggcgcc 5940 
cgggaggcct ggccgtgttg gccgcctttc tggaggtacc agatcctgga caccccctac 6000 
tccccgcttt cccatcccat gctcctcccg gactctatcg tggagccaga gaccccatcc 6060 
cagcaagctc actcaggccc ctggctgaca aactcattca cgcactgttt gttcatttaa 6120 
cacccactgt gaaccaggca ccagccccca acaaggattc tgaagctgta ggtccttgcc 6180 
tctaaggagc ccacagccag tgggggaggc tgacatgaca gacacatagg aaggacatag 6240 
taaagatggt ggtcacagag gaggtgacac ttaaagcctt cactggtaga aaagaaaagg 63 00 
aggtgttcat tgcagaggaa acagaatgtg caaagactca gaatatggcc tatttaggga 6360 
atggctacat acaccatgat tagaggaggc ccagtaaagg gaagggatgg tgagatgcct 6420 
gctaggttca ctcactcact tttatttatt tatttatttt tttgacagtc tctctgtcgc 6480 
ccaggctgga gtgcagtggt gtgatcttgg gtcactgcaa cttccgcctc ccgggttcaa 654 0 
gggattctcc tgcctcagct tcctgagtag ctggggttac aggtgtgtgc caccatgccc 6600 
agctaatttt tttttgtatt tttagtagac agggtttcac catgttggtc aggctggtct 6660 
caaactcctg gcctcaagtg atccgcctga ctcagcctac caaagtgctg attacaagtg 6720 
tgagccaccg tgcccagcca cactcactga ttctttaatg ccagccacac agcacaaagt 6780 
tcagagaaat gcctccatca tagcatgtca atatgttcat actcttaggt tcatgatgtt 6840 
cttaacatta ggttcataag caaaataaga aaaaagaata ataaataaaa gaagtggcat 6900 
gtcaggacct cacctgaaaa gccaaacaca gaatcatgaa ggtgaatgca gaggtgacac 6960 
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caacacaaag gtgtatatat ggtttcctgt ggggagtatg tacggaggca gcagtgagtg 7020 
agactgcaaa cgtcagaagg gcacgggtca ctgagagcct agtatcctag taaagtgggc 7080 
tctctccctc tctctccagc ttgtcattga aaaccagtcc accaagcttg ttggttcgca 7140 
cagcaagagt acatagagtt tgaaataata cataggattt taagagggag acactgtctc 72 00 
taaaaaaaaa aacaacagca acaacaaaaa gcaacaacca ttacaatttt atgttccctc 7260 
agcattctca gagctgagga atgggagagg actatgggaa cccccttcat gttccggcct 732 0 
tcagccatgg ccctggatac atgcactcat ctgtcttaca atgtcattcc cccaggaggg 7380 
cccggaagaa aacagtgcct atgagcagtt gctgtctcgc ttggaagaaa tcgctgagga 7440 
aggtcagttt gttggtctgg ccactaatct ctgtggccta gttcataaag aatcaccctt 7500 
tggagcttca ggtctgaggc tggagatggg ctccctccag tgcaggaggg attgaagcat 7560 
gagccagcgc tcatcttgat aataaccatg aagctgacag acacagttac ccgcaaacgg 762 0 
ctgcctacag attgaaaacc aagcaaaaac cgccgggcac ggtggctcac gcctgtaatc 7680 
ccagcacttt gggaggccaa ggcaggtgga tcacgaggtc aagagatcaa gaccatcctg 774 0 
gccaacatgg tgaaacccca tctctactaa aaatacgaaa aaatagccag gcgtggtggc 7800 
gggtgcctgt aatcccagct actcgggagg ctgaggcagg agaatggcat gaacccggga 7860 
ggcagaagtt gcagtgagcc gagatcgtgc cactgcactc cagcctgggc aacagagcga 792 0 
gactcttgtc tcaaaaaaaa aaaaaaaaaa gaaaaccaag caaaaaccaa aatgagacaa 7980 
aaaaaacaag accaaaaaat ggtgtttgga aattgtcaag gtcaagtctg gagagctaaa 804 0 
ctttttctga gaactgttta tctttaataa gcatcaaata ttttaacttt gtaaatactt 8100 
ttgttggaaa tcgttctctt cttagtcact cttgggtcat tttaaatctc acttactcta 8160 
ctagaccttt taggtttctg ctagactagg tagaactctg cctttgcatt tcttgtgtct 8220 
gttttgtata gttatcaata ttcatattta tttacaagtt attcagatca ttttttcttt 8280 
tctttttttt tttttttttt ttttttacat ctttagtaga gacagggttt caccatattg 8340 
gccaggctgc tctcaaactc ctgaccttgt gatccaccag cctcggcctc ccaaagtgct 8400 
gggattcatt ttttcttttt aatttgctct gggcttaaac ttgtggccca gcactttatg 8460 
atggtacaca gagttaagag tgtagactca gacggtcttt cttctttcct tctcttcctt 8520 
cctcccttcc ctcccacctt cccttctctc cttcctttct ttcttcctct cttgcttcct 8580 
caggcctctt ccagttgctc caaagccctg tacttttttt tgagttaacg tcttatggga 8640 
agggcctgca cttagtgaag aagtggtctc agagttgagt taccttggct tctgggaggt 8700 
gaaactgtat ccctataccc tgaagcttta agggggtgca atgtagatga gaccccaaca 8760 
tagatcctct tcacaggctc agagactcag gtcccaggac tggacatatc tgcactcctg 8820 
ccctctgact tcagccgcta cttccaatat gaggggtctc tgactacacc gccctgtgcc 8880 
cagggtgtca tctggactgt gtttaaccag acagtgatgc tgagtgctaa gcaggtgggc 8940 
ctggggtgtg tgtggacaca gtgggtgcgg gggaaagagg atgtaagatg agatgagaaa 9000 
caggagaaga aagaaatcaa ggctgggctc tgtggcttac gcctataatc ccaccacgtt 9060 
gggaggctga ggtgggagaa tggtttgagc ccaggagttc aagacaaggc ggggcaacat 912 0 
agtgtgaccc catctctacc aaaaaaaccc caacaaaacc aaaaatagcc gggcatggtg 9180 
gtatgcggcc tagtcccagc tactcaagga ggctgaggtg ggaagatcgc ttgattccag 924 0 
gagtttgaga ctgcagtgag ctatgatccc accactgcct accatcttta ggatacattt 9300 
atttatttat aaaagaaatc aagaggctgg atggggaata caggagctgg agggtggagc 9360 
cctgaggtgc tggttgtgag ctggcctggg acccttgttt cctgtcatgc catgaaccca 9420 
cccacactgt ccactgacct ccctagctcc acaccctctc tgacaccctg tggggacctg 9480 
gtgactctcg gctacagctg aacttccgag cgacgcagcc tttgaatggg cgagtgattg 9540 
aggcctcctt ccctgctgga gtggacagca gtcctcgggc tgctgagcca ggtacagctt 9600 
tgtctggttt ccccccagcc agtagtccct tatcctccca tgtgtgtgcc agtgtctgtc 9660 
attggtggtc acagcccgcc tctcacatct cctttttctc tccagtccag ctgaattcct 9720 



10 



gcctggctgc tggtgagtct gcccctcctc 
ccattcagcc ccagggctgc tcaggaccgc 
accccaaccc caatattaga gaggcagatc 
gctaattgat tagaatgaag cttgagaaat 
cccccctttt tttaaagata gggtctcact 
cgatcatagc tcactgcagc ctcgaactcc 
ctcaaagcac tgggactgta ggcatgagcc 
ttggctttta ggaagcaaaa acggtgctta 
cccttggctg gcctcttctg gagactgagg 
ggggtggtgg agtgcactga ggcaggtgtt 
aaagcagccc tctctgctct ccatcgcagg 
ttttgctgtc accagcgtcg cgttccttgt 
tgaccctttc ttcaggcaca agcttccccc 
atgcaaatga gctgctcctg ggccagtttt 
gaaggggaac caaagggggt gtgagctacc 
agaggctgga tcttggagaa tgtgagaagc 
actgtcctgt cctgctcatt atgccacttc 
aatatttata ataaaatatg tgttagtcac 
gaatttccta ttactgttat tagcaccaat 
tcggcctcct tccacacatc actccaatgt 



ttggtcctga tgccaggaga ctcctcagca 9780 
ctctgctccc tctccttttc tgcagaacag 9840 
atggtgggga ttcccccatt gtccccagag 9900 
ctcccagcat ccctctcgca aaagaatccc 9960 
ctgtttgccc caggctgggg tgttgtggca 10020 
taggctcagg caatcctttc accttagctt 10080 
actgtgcctg gccccaaacg gcccttttac 10140 
tcttacccct tctcgtgtat ccaccctcat 10200 
cactatgggg ctgcctgaga actcggggca 10260 
gaggaactct gcagacccct cttccttccc 10320 
tgacatccta gccctggttt ttggcctcct 10380 
gcagatgaga aggcagcaca ggtattacac 1044 0 
acccttgtgg agtcacttca tgcaaagcgc 10500 
ctgattagcc tttcctgttg tgtacacaca 10560 
gcccagcaga ggtagccgag actggagcct 10620 
cagccagagg catctgaggg ggagccggta 10680 
cttttaactg ccaagaaatt ttttaaaata 10740 
ctttgttccc caaatcagaa ggaggtattt 10800 
ttagtggtaa tgcatttatt ctattacagt 10860 
gttgctcc 10898 



<210> 6 
<211> 37 
<212> PRT 
<213> HUMAN 

<400> 6 

Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu lie Pro Ala 
15 10 15 

Pro Ala Pro Gly Leu Thr Val Gin Leu Leu Leu Ser Leu Leu Leu Leu 
20 25 30 

Met Pro Val His Pro 
35 



<210> 7 
<211> 25 
<212> DNA 
<213> HUMAN 



<400> 7 

tggggttctt gaggatctcc aggag 



25 



<210> 8 
<211> 26 
<212> DNA 
<213> HUMAN 

<400> 8 

ctctaacttc agggagccct cttctt 2 6 



<210> 9 
<211> 48 
<212> DNA 
<213> HUMAN 

<220> 

<221> primer_bind 
<222> (1) . . (48) 

<223> anchor primer that anneals to the homopolymeric tail. 
<220> 

<221> inosine 

<222> (36).. (37) (41).. (42) (46) . . (47) 

<223> each of the modif ied_bases at positions (36) , (37), (41), (42), (46) 
and (47) are inosine 

<400> 9 

cuacuacuac uaggccacgc gtcgactagt acgggaaggg aagggaag 48 

<210> 10 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 10 

Glu Glu Asp Leu Pro Ser 
1 5 



<210> 11 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 11 



12 



Gly Glu Asp Asp Pro Leu 
1 5 



<210> 12 
<211> 21 
<212> PRT 
<213> HUMAN 

<400> 12 

Asn Asn Ala His Arg Asp Lys Glu Gly Asp Asp Gin Ser His Trp Arg 
15 10 15 

Tyr Gly Gly Asp Pro 
20 



<210> 13 
<211> 16 
<212> PRT 
<213> HUMAN 

<400> 13 

His Pro Gin Arg Leu Pro Arg Met Gin Glu Asp Ser Pro Leu Gly Gly 
15 10 15 



<210> 14 
<211> 24 
<212> PRT 
<213> HUMAN 

<400> 14 

Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu Glu Asp Leu 
15 10 15 

Pro Gly Glu Glu Asp Leu Pro Gly 
20 



<210> 15 
<211> 13 
<212> PRT 
<213> HUMAN 

<400> 15 
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Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala Tyr Glu Gin 
15 10 



<210> 16 
<211> 16 
<212> PRT 
<213> HUMAN 

<400> 16 

Met Arg Arg Gin His Arg Arg Gly Thr Lys Gly Gly Val Ser Tyr Arg 
15 10 15 



<210> 17 
<211> 45 
<212> DNA 
<213> HUMAN 



<400> 17 

gtcgctagct ccatgggtca tatgcagagg ttgccccgga tgcag 



<210> 18 
<211> 43 
<212> DNA 
<213> HUMAN 



<400> 18 

gaagatctct tactcgagca ttctccaaga tccagcctct agg 



<210> 19 

<211> 10 

<212> DNA 

<213> HUMAN 

<400> 19 
ctccatctct 



<210> 20 
<211> 10 
<212> DNA 
<213> HUMAN 
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<400> 20 

ccacccccat 10 



<210> 21 

<211> 205 

<212> DNA 

<213> HUMAN 

<400> 21 

acctgcccct cactccaccc ccatcctagc 

acaaacctgt gagactttgg ctccatctct 

cccctccagg cttgctcctc ccccacccag 

cacaccgtgt gctgggacac cccac 



tttggtatgg gggagagggc acagggccag 60 
gcaaaagggc gctctgtgag tcagcctgct 12 0 
ctctcgtttc caatgcacgt acagcccgta 180 

205 



<210> 22 
<211> 8 
<212> PRT 
<213> HUMAN 

<400> 22 

Leu Glu His His His His His His 

1 . 5 



<210> 23 
<211> 10 
<212> DNA 
<213> HUMAN 

<220> 

<221> misc_f eature 
<222> (1) . . (10) 

<400> 23 

yyycayyyyy 10 



<210> 24 

<211> 10 

<212> DNA 

<213> HUMAN 

<300> 

<3 01> Locker and Buzard, 
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<303> DNA Sequencing and Mapping 

<304> 1 

<306> 3-11 

<307> 1990 

<400> 24 

tgtgagactt 10 



<210> 25 
<211> 4 
<212> PRT 
<213> HUMAN 

<220> 

<221> SITE 
<222> (1) . . (4) 

<223> sequence element defined by Suzuki, J. Mol . Biol . , 207 : 61-84 
(1989) as motif frequently found in gene regulatory proteins. 



<220> 

<221> VARIANTS 
<222> (3) . . (4) 

<223> variants in sequence element defined by Suzuki, J. Mol . Biol . , 
207 : 61-84 (1989) as motif frequently found in gene regulatory 
proteins . 

<400> 25 
Ser Pro Xaa Xaa 
1 



<210> 26 

<211> 4 

<212> PRT 

<213> HUMAN 

<220> 

<221> SITE 

<222> (1) . . (4) 

<223> sequence element defined by Suzuki, J. Mol . Biol . , 207 : 61-84 
(1989) as a motif frequently found in gene regulatory proteins. 

<220> 

<221> VARIANTS 
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<222> (3) . . (4) 

<223> variants in sequence element defined by Suzuki, J. Mol . Biol . . 
207 : 61-84 (1989) as a motif frequently found in gene 
regulatory proteins . 

<400> 26 
Thr Pro Xaa Xaa 
1 



<210> 27 
<211> 540 
<212> DNA 
<213> HUMAN 

<220> 

<221> promoter 
<222> (1) . . (540) 



<400> 27 

cttgcttttc attcaagctc aagtttgtct cccacatacc cattacttaa ctcaccctcg 60 

ggctccccta gcagcctgcc ctacctcttt acctgcttcc tggtggagtc agggatgtat 12 0 

acatgagctg ctttccctct cagccagagg acatgggggg ccccagctcc cctgcctttc 180 

cccttctgtg cctggagctg ggaagcaggc cagggttagc tgaggctggc tggcaagcag 240 

ctgggtggtg ccagggagag cctgcatagt gccaggtggt gccttgggtt ccaagctagt 3 00 

ccatggcccc gataaccttc tgcctgtgca cacacctgcc cctcactcca cccccatcct 360 

agctttggta tgggggagag ggcacagggc cagacaaacc tgtgagactt tggctccatc 42 0 

tctgcaaaag ggcgctctgt gagtcagcct gctcccctcc aggcttgctc ctcccccacc 480 

cagctctcgt ttccaatgca cgtacagccc gtacacaccg tgtgctggga caccccacag 54 0 



<210> 28 
<211> 445 
<212> DNA 
<213> HUMAN 

<220> 

<22l> exon 
<222> (1) 

<223> 1st MN exon 
<400> 28 

gcccgtacac accgtgtgct gggacacccc acagtcagcc gcatggctcc cctgtgcccc 60 
agcccctggc tccctctgtt gatcccggcc cctgctccag gcctcactgt gcaactgctg 12 0 
ctgtcactgc tgcttctggt gcctgtccat ccccagaggt tgccccggat gcaggaggat 180 
tcccccttgg gaggaggctc ttctggggaa gatgacccac tgggcgagga ggatctgccc 24 0 
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agtgaagagg attcacccag agaggaggat 
gaggatctac ctggagagga ggatctacct 
tccctgaagt tagaggatct acctactgtt 
aataatgccc acagggacaa agaag 



ccacccggag aggaggatct acctggagag 3 00 
gaagttaagc ctaaatcaga agaagagggc 360 
gaggctcctg gagatcctca agaaccccag 42 0 

445 



<210> 29 
<211> 30 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<22 3> 2nd MN exon 
<400> 29 

gggatgacca gagtcattgg cgctatggag 3 0 



<210> 30 
<211> 171 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<223> 3rd MN exon 
<400> 30 

gcgacccgcc ctggccccgg gtgtccccag 
atatccgccc ccagctcgcc gccttctgcc 
tccagctccc gccgctccca gaactgcgcc 



cctgcgcggg ccgcttccag tccccggtgg 60 
cggccctgcg ccccctggaa ctcctgggct 120 
tgcgcaacaa tggccacagt g 171 



<210> 31 
<211> 143 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<223> 4th MN exon 
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<400> 31 

tgcaactgac cctgcctcct gggctagaga 

ctctgcagct gcatctgcac tggggggctg 

aaggccaccg tttccctgcc gag 



tggctctggg tcccgggcgg gagtaccggg 60 
caggtcgtcc gggctcggag cacactgtgg 12 0 

143 



<210> 32 
<211> 93 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<223> 5th MN exon 
<400> 32 

atccacgtgg ttcacctcag caccgccttt gccagagttg acgaggcctt ggggcgcccg 60 
ggaggcctgg ccgtgttggc cgcctttctg gag 93 



<210> 33 
<211> 67 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<223> 6th MN exon 
<400> 33 

gagggcccgg aagaaaacag tgcctatgag 
gaggaag 



cagttgctgt ctcgcttgga agaaatcgct 60 

67 



<210> 34 
<211> 158 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 

<222> (1) 

<22 3> 7th MN exon 
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<400> 34 

gctcagagac tcaggtccca ggactggaca 
gctacttcca atatgagggg tctctgacta 
ctgtgtttaa ccagacagtg atgctgagtg 



tatctgcact cctgccctct gacttcagcc 60 
caccgccctg tgcccagggt gtcatctgga 12 0 
ctaagcag 158 



<210> 35 
<211> 145 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 

<222> (1) 

<223> 8th MN exon 

<400> 35 

ctccacaccc tctctgacac cctgtgggga 
cgagcgacgc agcctttgaa tgggcgagtg 
agcagtcctc gggctgctga gccag 



cctggtgact ctcggctaca gctgaacttc 60 
attgaggcct ccttccctgc tggagtggac 120 

145 



<210> 36 
<211> 27 
<212> DNA 
<213> HUMAN 

<220> 

<221> exon 
<222> (1) 

<223> 9th MN exon 
<400> 36 

tccagctgaa ttcctgcctg gctgctg 27 



<210> 37 

<211> 82 

<212> DNA 

<213> HUMAN 

<220> 

<221> exon 

<222> (1) 

<223> 10th MN exon 



20 



<400> 37 

gtgacatcct agccctggtt tttggcctcc tttttgctgt caccagcgtc gcgttccttg 60 
tgcagatgag aaggcagcac ag 82 



<210> 38 

<211> 191 

<212> DNA 

<213> HUMAN 



<220> 

<221> exon 
<222> (1) 

<223> 11th MN exon 



<400> 38 

aaggggaacc aaagggggtg tgagctaccg 
gaggctggat cttggagaat gtgagaagcc 
ctgtcctgtc ctgctcatta tgccacttcc 
atatttataa t 



cccagcagag gtagccgaga ctggagccta 60 
agccagaggc atctgagggg gagccggtaa 120 
ttttaactgc caagaaattt tttaaaataa 180 

191 



<210> 39 
<211> 1174 
<212> DNA 
<213> HUMAN 



<220> 

<221> intron 
<222> (1) . . (1174) 
<223> 1st MN intron 



<400> 39 

gtaagtggtc atcaatctcc aaatccaggt tccaggaggt tcatgactcc cctcccatac 60 
cccagcctag gctctgttca ctcagggaag gaggggagac tgtactcccc acagaagccc 12 0 
ttccagaggt cccataccaa tatccccatc cccactctcg gaggtagaaa gggacagatg 180 
tggagagaaa ataaaaaggg tgcaaaagga gagaggtgag ctggatgaga tgggagagaa 24 0 
gggggaggct ggagaagaga aagggatgag aactgcagat gagagaaaaa atgtgcagac 3 00 
agaggaaaaa aataggtgga gaaggagagt cagagagttt gaggggaaga gaaaaggaaa 360 
gcttgggagg tgaagtgggt accagagaca agcaagaaga gctggtagaa gtcatctcat 42 0 
cttaggctac aatgaggaat tgagacctag gaagaaggga cacagcaggt agagaaacgt 480 
ggcttcttga ctcccaagcc aggaatttgg ggaaaggggt tggagaccat acaaggcaga 540 
gggatgagtg gggagaagaa agaagggaga aaggaaagat ggtgtactca ctcatttggg 600 
actcaggact gaagtgccca ctcacttttt tttttttttt ttttgagaca aactttcact 660 
tttgttgccc aggctggagt gcaatggcgc gatctcggct cactgcaacc tccacctccc 72 0 
gggttcaagt gattctcctg cctcagcctc tagccaagta gctgcgatta caggcatgcg 780 
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ccaccacgcc cggctaattt ttgtattttt 
ggctggtctc gaactcctga tctcaggtga 
gattataggc gtgagccaca gcgcctggcc 
agacaatgat tgcaagctgg taggattgct 
tgggtgcggt ctcctgtgct ttgcacctgg 
tcctgtaagg catctgcgtt tgtgacatcg 
aagcttgagc ggttcatcct tttcatttat 



agtagagacg gggtttcgcc atgttggtca 84 0 
tccaaccacc ctggcctccc aaagtgctgg 90 0 
tgaagcagcc actcactttt acagacccta 960 
gtttggccca cccagctgcg gtgttgagtt 102 0 
cccgcttaag gcatttgtta cccgtaatgc 1080 
ttttggtcgc caggaaggga ttggggctct 114 0 
acag 1174 



<210> 40 
<211> 193 
<212> DNA 
<213> HUMAN 



<220> 

<221> intron 
<222> (1) . . (193) 
<223> 2nd MN intron 



<400> 40 

gtgagacacc cacccgctgc acagacccaa 
acagccgtcc ctgaacactg gtcccgggcg 
caccttttct acccgggttc cctaagttcc 
tctcccaccc cag 



tctgggaacc cagctctgtg gatctcccct 60 
tcccacccgc cgcccaccgt cccaccccct 120 
tgacctaggc gtcagacttc ctcactatac 180 

193 



<210> 41 
<211> 131 
<212> DNA 
<213> HUMAN 



<220> 

<22l> intron 
<222> (1) . . (131) 
<22 3> 3rd MN intron 



<400> 41 

gtgagggggt ctccccgccg agacttgggg 
gcgcagtgcc tgcccggggg ttgggctggc 
tccctacgca g 



atggggcggg gcgcagggaa gggaaccgtc 60 
cctaccgggc ggggccggct cacttgcctc 12 0 

131 



<210> 42 
<211> 89 
<212> DNA 
<213> HUMAN 
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<220> 

<22l> intron 
<222> (1) . . (89) 
<223> 4th MN intron 



<400> 42 

gtgagcgcgg actggccgag aaggggcaaa ggagcggggc ggacgggggc cagagacgtg 60 
gccctctcct accctcgtgt ccttttcag 89 



<210> 43 
<211> 1400 
<212> DNA 
<213> HUMAN 



<220> 

<221> intron 
<222> (1) . . (1400) 
<223> 5th MN intron 



<400> 43 

gtaccagatc ctggacaccc cctactcccc gctttcccat cccatgctcc tcccggactc 60 
tatcgtggag ccagagaccc catcccagca agctcactca ggcccctggc tgacaaactc 12 0 
attcacgcac tgtttgttca tttaacaccc actgtgaacc aggcaccagc ccccaacaag 180 
gattctgaag ctgtaggtcc ttgcctctaa ggagcccaca gccagtgggg gaggctgaca 240 
tgacagacac ataggaagga catagtaaag atggtggtca cagaggaggt gacacttaaa 3 00 
gccttcactg gtagaaaaga aaaggaggtg ttcattgcag aggaaacaga atgtgcaaag 3 60 
actcagaata tggcctattt agggaatggc tacatacacc atgattagag gaggcccagt 420 
aaagggaagg gatggtgaga tgcctgctag gttcactcac tcacttttat ttatttattt 480 
atttttttga cagtctctct gtcgcccagg ctggagtgca gtggtgtgat cttgggtcac 540 
tgcaacttcc gcctcccggg ttcaagggat tctcctgcct cagcttcctg agtagctggg 600 
gttacaggtg tgtgccacca tgcccagcta attttttttt gtatttttag tagacagggt 660 
ttcaccatgt tggtcaggct ggtctcaaac tcctggcctc aagtgatccg cctgactcag 720 
cctaccaaag tgctgattac aagtgtgagc caccgtgccc agccacactc actgattctt 780 
taatgccagc cacacagcac aaagttcaga gaaatgcctc catcatagca tgtcaatatg 840 
ttcatactct taggttcatg atgttcttaa cattaggttc ataagcaaaa taagaaaaaa 900 
gaataataaa taaaagaagt ggcatgtcag gacctcacct gaaaagccaa acacagaatc 960 
atgaaggtga atgcagaggt gacaccaaca caaaggtgta tatatggttt cctgtgggga 102 0 
gtatgtacgg aggcagcagt gagtgagact gcaaacgtca gaagggcacg ggtcactgag 108 0 
agcctagtat cctagtaaag tgggctctct ccctctctct ccagcttgtc attgaaaacc 114 0 
agtccaccaa gcttgttggt tcgcacagca agagtacata gagtttgaaa taatacatag 1200 
gattttaaga gggagacact gtctctaaaa aaaaaaacaa cagcaacaac aaaaagcaac 1260 
aaccattaca attttatgtt ccctcagcat tctcagagct gaggaatggg agaggactat 132 0 
gggaaccccc ttcatgttcc ggccttcagc catggccctg gatacatgca ctcatctgtc 1380 
ttacaatgtc attcccccag 1400 
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<210> 44 
<211> 1334 
<212> DNA 
<213> HUMAN 



<220> 

<221> intron 
<222> (1) . . (1334) 
<223> 6th MN intron 



<400> 44 

gtcagtttgt tggtctggcc actaatctct gtggcctagt tcataaagaa tcaccctttg 60 

gagcttcagg tctgaggctg gagatgggct ccctccagtg caggagggat tgaagcatga 120 

gccagcgctc atcttgataa taaccatgaa gctgacagac acagttaccc gcaaacggct 180 

gcctacagat tgaaaaccaa gcaaaaaccg ccgggcacgg tggctcacgc ctgtaatccc 240 

agcactttgg gaggccaagg caggtggatc acgaggtcaa gagatcaaga ccatcctggc 3 00 

caacatggtg aaaccccatc tctactaaaa atacgaaaaa atagccaggc gtggtggcgg 3 60 

gtgcctgtaa tcccagctac tcgggaggct gaggcaggag aatggcatga acccgggagg 420 

cagaagttgc agtgagccga gatcgtgcca ctgcactcca gcctgggcaa cagagcgaga 480 

ctcttgtctc aaaaaaaaaa aaaaaaaaga aaaccaagca aaaaccaaaa tgagacaaaa 540 

aaaacaagac caaaaaatgg tgtttggaaa ttgtcaaggt caagtctgga gagctaaact 600 

ttttctgaga actgtttatc tttaataagc atcaaatatt ttaactttgt aaatactttt 660 

gttggaaatc gttctcttct tagtcactct tgggtcattt taaatctcac ttactctact 720 

agacctttta ggtttctgct agactaggta gaactctgcc tttgcatttc ttgtgtctgt 780 

tttgtatagt tatcaatatt catatttatt tacaagttat tcagatcatt ttttcttttc 840 

tttttttttt tttttttttt ttttacatct ttagtagaga cagggtttca ccatattggc 900 

caggctgctc tcaaactcct gaccttgtga tccaccagcc tcggcctccc aaagtgctgg 960 

gattcatttt ttctttttaa tttgctctgg gcttaaactt gtggcccagc actttatgat 1020 

ggtacacaga gttaagagtg tagactcaga cggtctttct tctttccttc tcttccttcc 1080 

tcccttccct cccaccttcc cttctctcct tcctttcttt cttcctctct tgcttcctca 1140 

ggcctcttcc agttgctcca aagccctgta cttttttttg agttaacgtc ttatgggaag 1200 

ggcctgcact tagtgaagaa gtggtctcag agttgagtta ccttggcttc tgggaggtga 12 60 

aactgtatcc ctataccctg aagctttaag ggggtgcaat gtagatgaga ccccaacata 1320 
gatcctcttc acag 1334 



<210> 45 

<211> 512 

<212> DNA 

<213> HUMAN 



<220> 

<221> intron 
<222> (1) . . (512) 
<223> 7th MN intron 



24 



<400> 45 

gtgggcctgg ggtgtgtgtg gacacagtgg gtgcggggga aagaggatgt aagatgagat 60 
gagaaacagg agaagaaaga aatcaaggct gggctctgtg gcttacgcct ataatcccac 12 0 
cacgttggga ggctgaggtg ggagaatggt ttgagcccag gagttcaaga caaggcgggg 18 0 
caacatagtg tgaccccatc tctaccaaaa aaaccccaac aaaaccaaaa atagccgggc 24 0 
atggtggtat gcggcctagt cccagctact caaggaggct gaggtgggaa gatcgcttga 30 0 
ttccaggagt ttgagactgc agtgagctat gatcccacca ctgcctacca tctttaggat 3 60 
acatttattt atttataaaa gaaatcaaga ggctggatgg ggaatacagg agctggaggg 42 0 
tggagccctg aggtgctggt tgtgagctgg cctgggaccc ttgtttcctg tcatgccatg 480 
aacccaccca cactgtccac tgacctccct ag 512 



<210> 46 
<211> 114 
<212> DNA 
<213> HUMAN 

<220> 

<221> intron 
<222> (1) . . (114) 
<223> 8th MN intron 

<400> 46 

gtacagcttt gtctggtttc cccccagcca gtagtccctt atcctcccat gtgtgtgcca 60 
gtgtctgtca ttggtggtca cagcccgcct ctcacatctc ctttttctct ccag 114 



<210> 47 

<211> 617 

<212> DNA 

<213> HUMAN 



<220> 

<221> intron 
<222> (1) . . (617) 
<223> 9th MN intron 



<400> 47 

gtgagtctgc ccctcctctt ggtcctgatg ccaggagact cctcagcacc attcagcccc 60 
agggctgctc -aggaccgcct ctgctccctc tccttttctg cagaacagac cccaacccca 12 0 
atattagaga ggcagatcat ggtggggatt cccccattgt ccccagaggc taattgatta 180 
gaatgaagct tgagaaatct cccagcatcc ctctcgcaaa agaatccccc cccctttttt 240 
taaagatagg gtctcactct gtttgcccca ggctggggtg ttgtggcacg atcatagctc 300 
actgcagcct cgaactccta ggctcaggca atcctttcac cttagcttct caaagcactg 360 
ggactgtagg catgagccac tgtgcctggc cccaaacggc ccttttactt ggcttttagg 42 0 
aagcaaaaac ggtgcttatc ttaccccttc tcgtgtatcc accctcatcc cttggctggc 480 
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ctcttctgga gactgaggca ctatggggct gcctgagaac tcggggcagg ggtggtggag 54 0 
tgcactgagg caggtgttga ggaactctgc agacccctct tccttcccaa agcagccctc 600 
tctgctctcc atcgcag 617 



<210> 48 

<211> 130 

<212> DNA 

<213> HUMAN 



<220> 

<221> intron 

<222> (1) . . (130) 

<223> 10th MN intron 



<400> 48 

gtattacact gaccctttct tcaggcacaa 
gcaaagcgca tgcaaatgag ctgctcctgg 
gtacacacag 



gcttccccca cccttgtgga gtcacttcat 60 
gccagttttc tgattagcct ttcctgttgt 12 0 

130 



<210> 49 
<211> 1401 
<212> DNA 
<213> HUMAN 



<400> 49 

caaactttca cttttgttgc ccaggctgga gtgcaatggc gcgatctcgg ctcactgcaa 60 
cctccacctc ccgggttcaa gtgattctcc tgcctcagcc tctagccaag tagctgcgat 12 0 
tacaggcatg cgccaccacg cccggctaat ttttgtattt ttagtagaga cggggtttcg 180 
ccatgttggt caggctggtc tcgaactcct gatctcaggt gatccaacca ccctggcctc 240 
ccaaagtgct gggattatag gcgtgagcca cagcgcctgg cctgaagcag ccactcactt 3 00 
ttacagaccc taagacaatg attgcaagct ggtaggattg ctgtttggcc cacccagctg 3 60 
cggtgttgag tttgggtgcg gtctcctgtg ctttgcacct ggcccgctta aggcatttgt 42 0 
tacccgtaat gctcctgtaa ggcatctgcg tttgtgacat cgttttggtc gccaggaagg 480 
gattggggct ctaagcttga gcggttcatc cttttcattt atacagggga tgaccagagt 54 0 
cattggcgct atggaggtga gacacccacc cgctgcacag acccaatctg ggaacccagc 600 
tctgtggatc tcccctacag ccgtccctga acactggtcc cgggcgtccc acccgccgcc 660 
caccgtccca ccccctcacc ttttctaccc gggttcccta agttcctgac ctaggcgtca 720 
gacttcctca ctatactctc ccaccccagg cgacccgccc tggccccggg tgtccccagc 780 
ctgcgcgggc cgcttccagt ccccggtgga tatccgcccc cagctcgccg ccttctgccc 840 
ggccctgcgc cccctggaac tcctgggctt ccagctcccg ccgctcccag aactgcgcct 900 
gcgcaacaat ggccacagtg gtgagggggt ctccccgccg agacttgggg atggggcggg 960 
gcgcagggaa gggaaccgtc gcgcagtgcc tgcccggggg ttgggctggc cctaccgggc 102 0 
ggggccggct cacttgcctc tccctacgca gtgcaactga ccctgcctcc tgggctagag 1080 
atggctctgg gtcccgggcg ggagtaccgg gctctgcagc tgcatctgca ctggggggct 1140 
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gcaggtcgtc cgggctcgga gcacactgtg 
gcggactggc cgagaagggg caaaggagcg 
tcctaccctc gtgtcctttt cagatccacg 
ttgacgaggc cttggggcgc ccgggaggcc 
cagatcctgg acacccccta c 



gaaggccacc gtttccctgc cgaggtgagc 12 00 
gggcggacgg gggccagaga cgtggccctc 1260 
tggttcacct cagcaccgcc tttgccagag 132 0 
tggccgtgtt ggccgccttt ctggaggtac 1380 

1401 



<210> 50 
<211> 59 
<212> PRT 
<213> HUMAN 

<400> 50 
Ser Ser Gly Glu 
1 

Glu Asp Ser Pro 
20 

Gly Glu Glu Asp 
35 

Lys Ser Glu Glu 
50 



Asp Asp Pro Leu 
5 

Arg Glu Glu Asp 

Leu Pro Gly Glu 
40 

Glu Gly Ser Leu 
55 



Gly Glu Glu Asp 
10 

Pro Pro Gly Glu 
25 

Glu Asp Leu Pro 



Lys Leu Glu 



Leu Pro Ser Glu 
15 

Glu Asp Leu Pro 
30 

Glu Val Lys Pro 
45 



<210> 51 
<211> 257 
<212> PRT 
<213> HUMAN 

<400> 51 

Gly Asp Asp Gin Ser His Trp Arg Tyr Gly Gly Asp Pro Pro Trp Pro 
15 10 15 

Arg Val Ser Pro Ala Cys Ala Gly Arg Phe Gin Ser Pro Val Asp lie 
20 25 30 

Arg Pro Gin Leu Ala Ala Phe Cys Pro Ala Leu Arg Pro Leu Glu Leu 
35 40 45 

Leu Gly Phe Gin Leu Pro Pro Leu Pro Glu Leu Arg Leu Arg Asn Asn 
50 55 60 

Gly His Ser Val Gin Leu Thr Leu Pro Pro Gly Leu Glu Met Ala Leu 
65 70 75 80 
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Gly Pro Gly Arg Glu Tyr Arg Ala Leu Gin Leu His Leu His Trp Gly 

85 90 95 



Ala Ala Gly Arg 
100 

Pro Ala Glu lie 
115 

Asp Glu Ala Leu 
130 

Leu Glu Glu Gly 
145 

Arg Leu Glu Glu 



Leu Asp lie Ser 
180 

Tyr Glu Gly Ser 
195 

Thr Val Phe Asn 
210 

Leu Ser Asp Thr 
225 

Phe Arg Ala Thr 



Pro Gly Ser Glu 



His Val Val His 
120 

Gly Arg Pro Gly 
135 

Pro Glu Glu Asn 
150 

He Ala Glu Glu 
165 

Ala Leu Leu Pro 



Leu Thr Thr Pro 
200 

Gin Thr Val Met 
215 

Leu Trp Gly Pro 
230 

Gin Pro Leu Asn 
245 



His Thr Val Glu 
105 

Leu Ser Thr Ala 



Gly Leu Ala Val 
14 0 

Ser Ala Tyr Glu 
155 

Gly Ser Glu Thr 
170 

Ser Asp Phe Ser 
185 

Pro Cys Ala Gin 



Leu Ser Ala Lys 
220 

Gly Asp Ser Arg 
235 

Gly Arg Val He 
250 



Gly His Arg Phe 
110 

Phe Ala Arg Val 
125 

Leu Ala Ala Phe 



Gin Leu Leu Ser 
160 

Gin Val Pro Gly 
175 

Arg Tyr Phe Gin 
190 

Gly Val He Trp 
205 

Gin Leu His Thr 



Leu Gin Leu Asn 
240 

Glu Ala Ser Phe 
255 



Pro 



<210> 52 
<211> 20 
<212> PRT 
<213> HUMAN 

<400> 52 

He Leu Ala Leu Val Phe Gly Leu Leu Phe Ala Val Thr Ser Val Ala 
15 10 15 
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Phe Leu Val Gin 
20 



<210> 53 
<211> 25 
<212> PRT 
<213> HUMAN 

<400> 53 

Met Arg Arg Gin His Arg Arg Gly 
1 5 

Pro Ala Glu Val Ala Glu Thr Gly 
20 



Thr Lys Gly Gly Val Ser Tyr Arg 
10 15 

Ala 
25 



<210> 54 
<211> 59 
<212> PRT 
<213> HUMAN 

<400> 54 
Ser Ala Ser Glu 
1 

Glu Pro Ser Pro 
20 

Ser Val Val Leu 
35 

Ser Pro Ser Glu 
50 



Glu Pro Ser Pro 

5 

Ser Glu Glu Pro 



Phe Pro Ser Glu 
40 

Glu Pro Ser Ala 
55 



Ser Glu Val Pro 
10 

Phe Pro Ser Val 
25 

Glu Pro Phe Pro 



Ser Glu Glu 



Phe Pro Ser Glu 
15 

Arg Pro Phe Pro 
30 

Ser Lys Glu Pro 
45 



<210> 55 
<211> 470 
<212> RNA 
<213> HUMAN 

<400> 55 

cauggccccg auaaccuucu gccugugcac 
gcuuugguau gggggagagg gcacagggcc 
cugcaaaagg gcgcucugug agucagccug 
agcucucguu uccaaugcac guacagcccg 



acaccugccc cucacuccac ccccauccua 60 
agacaaaccu gugagacuuu ggcuccaucu 12 0 
cuccccucca ggcuugcucc ucccccaccc 180 
uacacaccgu gugcugggac accccacagu 24 0 



29 



cagccgcaug gcuccccugu gccccagccc cuggcucccu cuguugaucc cggccccugc 300 
uccaggccuc acugugcaac ugcugcuguc acugcugcuu cuggugccug uccaucccca 360 
gagguugccc cggaugcagg aggauucccc cuugggagga ggcucuucug gggaagauga 420 
cccacugggc gaggaggauc ugcccaguga agaggauuca cccagagagg 470 



<210> 56 
<211> 292 
<212> DNA 
<213> HUMAN 

<400> 56 

gtttttttga gacggagtct tgcatctgtc atgcccaggc tggagtagca gtggtgccat 60 
ctcggctcac tgcaagctcc acctcccgag ttcacgccat tttcctgcct cagcctcccg 120 
agtagctggg actacaggcg cccgccacca tgcccggcta attttttgta tttttggtag 180 
agacggggtt tcaccgtgtt agccagaatg gtctcgatct cctgacttcg tgatccaccc 240 
gcctcggcct cccaaagttc tgggattaca ggtgtgagcc accgcacctg gc 2 92 



<210> 57 
<211> 262 
<212> DNA 
<213> HUMAN 

<400> 57 

tttctttttt gagacagggt cttgctctgt cacccaggcc agagtgcaat ggtacagtct 60 
cagctcactg cagcctcaac cgcctcggct caaaccatca tcccatttca gcctcctgag 120 
tagctgggac tacaggcaca tgccattaca cctggctaat ttttttgtat ttctagtaga 180 
gacagggttt ggccatgttg cccgggctgg tctcgaactc ctggactcaa gcaatccacc 240 
cacctcagcc tcccaaaatg ag 262 



<210> 58 

<211> 2501 

<212> DNA 

<213> HUMAN 

<220> 

<221> misc_f eature 
<222> (1) . . (2501) 

<223> region 5' to transcription initiation site as determined by 
RNase protection assay (nucleotide 3507 of Figures 2A-2F and 
of SEQ ID NO: 5), corresponding to region of SEQ ID NO: 5 
and Figures 2A-2F from nucleotide (7) to nucleotide (2 507) , 
in which region some regulatory elements are probably situated. 
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<220> 

<221> unsure what base is at position 1968 
<222> (1968) 

<223> unsure of base at position 1968 , which is the same unknown base 
as that at position 1974 of SEQ ID MO. 5, i.e., the full-length 
MN genomic sequence, and of that unknown at position 1968 of SEQ 
ID NO: 90, and unknown at position 647 of SEQ ID NO: 110. That 
unknown base is in the 5' region flanking the transcription 
initiation site (3507) as determined by RNase protection assay. 

<400> 58 

tgttgactcg tgaccttacc cccaaccctg tgctctctga aacatgagct gtgtccactc 60 
agggttaaat . ggattaaggg cggtgcaaga tgtgctttgt taaacagatg cttgaaggca 12 0 
gcatgctcgt taagagtcat caccaatccc taatctcaag taatcaggga cacaaacact 180 
gcggaaggcc gcagggtcct ctgcctagga aaaccagaga cctttgttca cttgtttatc 240 
tgaccttccc tccactattg tccatgaccc tgccaaatcc ccctctgtga gaaacaccca 300 
agaattatca ataaaaaaat aaatttaaaa aaaaaataca aaaaaaaaaa aaaaaaaaaa 3 60 
aaaagactta cgaatagtta ttgataaatg aatagctatt ggtaaagcca agtaaatgat 42 0 
catattcaaa accagacggc catcatcaca gctcaagtct acctgatttg atctctttat 480 
cattgtcatt ctttggattc actagattag tcatcatcct caaaattctc ccccaagttc 540 
taattacgtt ccaaacattt aggggttaca tgaagcttga acctactacc ttctttgctt 600 
ttgagccatg agttgtagga atgatgagtt tacaccttac atgctgggga ttaatttaaa 660 
ctttacctct aagtcagttg ggtagccttt ggcttatttt tgtagctaat tttgtagtta 720 
atggatgcac tgtgaatctt gctatgatag ttttcctcca cactttgcca ctaggggtag 780 
gtaggtactc agttttcagt aattgcttac ctaagaccct aagccctatt tctcttgtac 840 
tggcctttat ctgtaatatg ggcatattta atacaatata atttttggag tttttttgtt 900 
tgtttgtttg tttgtttttt tgagacggag tcttgcatct gtcatgccca ggctggagta 960 
gcagtggtgc catctcggct cactgcaagc tccacctccc gagttcacgc cattttcctg 1020 
cctcagcctc ccgagtagct gggactacag gcgcccgcca ccatgcccgg ctaatttttt 108 0 
gtatttttgg tagagacggg gtttcaccgt gttagccaga atggtctcga tctcctgact 114 0 
tcgtgatcca cccgcctcgg cctcccaaag ttctgggatt acaggtgtga gccaccgcac 120 0 
ctggccaatt ttttgagtct tttaaagtaa aaatatgtct tgtaagctgg taactatggt 1260 
acatttcctt ttattaatgt ggtgctgacg gtcatatagg ttcttttgag tttggcatgc 132 0 
atatgctact ttttgcagtc ctttcattac atttttctct cttcatttga agagcatgtt 1380 
atatctttta gcttcacttg gcttaaaagg ttctctcatt agcctaacac agtgtcattg 1440 
ttggtaccac ttggatcata agtggaaaaa cagtcaagaa attgcacagt aatacttgtt 1500 
tgtaagaggg atgattcagg tgaatctgac actaagaaac tcccctacct gaggtctgag 1560 
attcctctga cattgctgta tataggcttt tcctttgaca gcctgtgact gcggactatt 162 0 
tttcttaagc aagatatgct aaagttttgt gagccttttt ccagagagag gtctcatatc 1680 
tgcatcaagt gagaacatat aatgtctgca tgtttccata tttcaggaat gtttgcttgt 1740 
gttttatgct tttatataga cagggaaact tgttcctcag tgacccaaaa gaggtgggaa 180 0 
ttgttattgg atatcatcat tggcccacgc tttctgacct tggaaacaat taagggttca 1860 
taatctcaat tctgtcagaa ttggtacaag aaatagctgc tatgtttctt gacattccac 1920 
ttggtaggaa ataagaatgt gaaactcttc agttggtgtg tgtccctngt ttttttgcaa 1980 
tttccttctt actgtgttaa aaaaaagtat gatcttgctc tgagaggtga ggcattctta 2 040 
atcatgatct ttaaagatca ataatataat cctttcaagg attatgtctt tattataata 2100 
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aagataattt gtctttaaca gaatcaataa tataatccct taaaggatta tatctttgct 2160 
gggcgcagtg gctcacacct gtaatcccag cactttgggt ggccaaggtg gaaggatcaa 2220 
atttgcctac ttctatatta tcttctaaag cagaattcat ctctcttccc tcaatatgat 2280 
gatattgaca gggtttgccc tcactcacta gattgtgagc tcctgctcag ggcaggtagc 2 340 
gttttttgtt tttgtttttg tttttctttt ttgagacagg gtcttgctct gtcacccagg 2400 
ccagagtgca atggtacagt ctcagctcac tgcagcctca accgcctcgg ctcaaaccat 2460 
catcccattt cagcctcctg agtagctggg actacaggca c 2501 



<210> 59 
<211> 292 
<212> DNA 
<213> HUMAN 



<220> 

<221> misc_f eature 
<222> (1) 



<400> 59 

tttttttgag acggagtctt gcatctgtca tgcccaggct ggagtagcag tggtgccatc 60 
tcggctcact gcaagctcca cctcccgagt tcacgccatt ttcctgcctc agcctcccga 120 
gtagctggga ctacaggcgc ccgccaccat gcccggctaa ttttttgtat ttttggtaga 180 
gacggggttt caccgtgtta gccagaatgg tctcgatctc ctgacttcgt gatccacccg 240 
cctcggcctc ccaaagttct gggattacag gtgtgagcca ccgcacctgg cc 2 92 



<210> 60 
<211> 262 
<212> DNA 
<213> HUMAN 



<400> 60 

ttcttttttg agacagggtc ttgctctgtc 
agctcactgc agcctcaacc gcctcggctc 
agctgggact acaggcacat gccattacac 
acagggtttg gccatgttgc ccgggctggt 
acctcagcct cccaaaatga gg 



acccaggcca gagtgcaatg gtacagtctc 60 
aaaccatcat cccatttcag cctcctgagt 12 0 
ctggctaatt tttttgtatt tctagtagag 180 
ctcgaactcc tggactcaag caatccaccc 240 

262 



<210> 61 
<211> 294 
<212> DNA 
<213> HUMAN 



<400> 61 

tttttttttg agacaaactt tcacttttgt tgcccaggct ggagtgcaat ggcgcgatct 60 
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cggctcactg caacctccac ctcccgggtt caagtgattc tcctgcctca gcctctagcc 120 

aagtagctgc gattacaggc atgcgccacc acgcccggct aatttttgta tttttagtag 180 

agacggggtt tcgccatgtt ggtcaggctg gtctcgaact cctgatctca ggtgatccaa 240 

ccaccctggc ctcccaaagt gctgggatta taggcgtgag ccacagcgcc tggc 2 94 



<210> 62 
<211> 276 
<212> DNA 
<213> HUMAN 



<400> 62 

tgacagtctc tctgtcgccc aggctggagt gcagtggtgt ^gatcttgggt cactgcaact 60 
tccgcctccc gggttcaagg gattctcctg cctcagcttc ctgagtagct ggggttacag 12 0 
gtgtgtgcca ccatgcccag ctaatttttt tttgtatttt tagtagacag ggtttcacca 180 
tgttggtcag gctggtctca aactcctggc ctcaagtgat ccgcctgact cagcctacca 240 
aagtgctgat tacaagtgtg agccaccgtg cccagc 276 



<210> 63 
<211> 289 
<212> DNA 
<213> HUMAN 



<400> 63 

cgccgggcac ggtggctcac gcctgtaatc ccagcacttt gggaggccaa ggcaggtgga 60 

tcacgaggtc aagagatcaa gaccatcctg gccaacatgg tgaaacccca tctctactaa 12 0 

aaatacgaaa aaatagccag gcgtggtggc gggtgcctgt aatcccagct actcgggagg 180 

ctgaggcagg agaatggcat gaacccggga ggcagaagtt gcagtgagcc gagatcgtgc 240 

cactgcactc cagcctgggc aacagagcga gactcttgtc tcaaaaaaa 289 



<210> 64 
<211> 298 
<212> DNA 
<213> HUMAN 



<400> 64 

aggctgggct ctgtggctta cgcctataat 
atggtttgag cccaggagtt caagacaagg 
caaaaaaacc ccaacaaaac caaaaatagc 
ctactcaagg aggctgaggt gggaagatcg 
gctatgatcc caccactgcc taccatcttt 



cccaccacgt tgggaggctg aggtgggaga 60 
cggggcaaca tagtgtgacc ccatctctac 12 0 
cgggcatggt ggtatgcggc ctagtcccag 180 
cttgattcca ggagtttgag actgcagtga 240 
aggatacatt tatttattta taaaagaa 2 98 



<210> 65 
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<211> 105 
<212> DNA 
<213> HUMAN 

<400> 65 

ttttttacat ctttagtaga gacagggttt caccatattg gccaggctgc tctcaaactc 60 
ctgaccttgt gatccaccag cctcggcctc ccaaagtgct gggat 105 



<210> 66 
<211> 83 
<212> DNA 
<213> HUMAN 

<400> 66 

cctcgaactc ctaggctcag gcaatccttt caccttagct tctcaaagca ctgggactgt 60 
aggcatgagc cactgtgcct ggc 83 



<210> 67 
<2ll> 11 
<212> DNA 
<213> HUMAN 

<400> 67 

agaaggtaag t 11 



<210> 68 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 68 

tggaggtgag a 11 



<210> 69 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 69 

cagtcgtgag g 11 
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<210> 70 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 70 
ccgaggtgag c 

<210> 71 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 71 
tggaggtacc a 

<210> 72 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 72 
ggaaggtcag t 

<210> 73 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 73 
agcaggtggg c 

<210> 74 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 74 

gccaggtaca g 



<210> 75 



<211> 11 
<212> DNA 
<213> HUMAN 

<400> 75 

tgctggtgag t 



<210> 76 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 76 
atacagggga t 

<210> 77 

<211> 11 

<212> DNA 

<213> HUMAN 

<400> 77 
atacagggga t 

<210> 78 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 78 
ccccaggcga c 

<210> 79 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 79 
acgcagtgca a 

<210> 80 
<211> 11 



<212> DNA 
<213> HUMAN 

<400> 80 
tttcagatcc a 



<210> 81 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 81 
ccccaggagg g 

<210> 82 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 82 
tcacaggctc a 

<210> 83 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 83 
ccctagctcc a 

<210> 84 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 84 
ctccagtcca g 



<210> 85 
<211> 12 
<212> DNA 



<213> HUMAN 



<400> 85 
tcgcaggtga ca 



<210> 86 
<211> 11 
<212> DNA 
<213> HUMAN 

<400> 86 
acacagaagg g 



<210> 87 
<211> 377 
<212> PRT 
<213> HUMAN 

<400> 87 
Gin Arg Leu Pro 
1 

Ser Gly Glu Asp 
20 

Asp Ser Pro Arg 
35 

Glu Glu Asp Leu 
50 

Ser Glu Glu Glu 
65 

Ala Pro Gly Asp 



Glu Gly Asp Asp 
100 

Pro Arg Val Ser 
115 



Arg Met Gin Glu 
5 

Asp Pro Leu Gly 



Glu Glu Asp Pro 
40 

Pro Gly Glu Glu 
55 

Gly Ser Leu Lys 
70 

Pro Gin Glu Pro 
85 

Gin Ser His Trp 



Pro Ala Cys Ala 
120 



Asp Ser Pro Leu 
10 

Glu Glu Asp Leu 
25 

Pro Gly Glu Glu 



Asp Leu Pro Glu 
60 

Leu Glu Asp Leu 
75 

Gin Asn Asn Ala 
90 

Arg Tyr Gly Gly 
105 

Gly Arg Phe Gin 



Gly Gly Gly Ser 
15 

Pro Ser Glu Glu 
30 

Asp Leu Pro Gly 
45 

Val Lys Pro Lys 



Pro Thr Val Glu 
80 

His Arg Asp Lys 
95 

Asp Pro Pro Trp 
110 

Ser Pro Val Asp 
125 
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lie Arg Pro Gin Leu Ala Ala Phe Cys Pro Ala Leu Arg Pro Leu Glu 
130 135 140 



Leu Leu Gly Phe Gin Leu Pro Pro Leu Pro Glu Leu Arg Leu Arg Asn 
145 150 155 160 

Asn Gly His Ser Val Gin Leu Thr Leu Pro Pro Gly Leu Glu Met Ala 

165 170 175 

Leu Gly Pro Gly Arg Glu Tyr Arg Ala Leu Gin Leu His Leu His Trp 
180 185 190 

Gly Ala Ala Gly Arg Pro Gly Ser Glu His Thr Val Glu Gly His Arg 
195 200 205 

Phe Pro Ala Glu He His Val Val His Leu Ser Thr Ala Phe Ala Arg 
210 215 220 

Val Asp Glu Ala Leu Gly Arg Pro Gly Gly Leu Ala Val Leu Ala Ala 
225 230 235 240 

Phe Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala Tyr Glu Gin Leu Leu 

245 250 255 

Ser Arg Leu Glu Glu He Ala Glu Glu Gly Ser Glu Thr Gin Val Pro 
260 265 270 

Gly Leu Asp He Ser Ala Leu Leu Pro Ser Asp Phe Ser Arg Tyr Phe 
275 280 285 

Gin Tyr Glu Gly Ser Leu Thr Thr Pro Pro Cys Ala Gin Gly Val He 
290 295 300 

Trp Thr Val Phe Asn Gin Thr Val Met Leu Ser Ala Lys Gin Leu His 
305 310 315 320 

Thr Leu Ser Asp Thr Leu Trp Gly Pro Gly Asp Ser Arg Leu Gin Leu 

325 330 335 

Asn Phe Arg Ala Thr Gin Pro Leu Asn Gly Arg Val He Glu Ala Ser 
340 345 350 

Phe Pro Ala Gly Val Asp Ser Ser Pro Arg Ala Ala Glu Pro Val Gin 
355 360 365 
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Leu Asn Ser Cys Leu Ala Ala Gly Asp 
370 375 



<210> 88 
<211> 34 
<212> DNA 
<213> HUMAN 

<400> 88 

tagacagatc tacgatggct cccctgtgcc ccag 34 



<210> 89 
<211> 34 
<212> DNA 
<213> HUMAN 

<400> 89 

attcctctag acagttaccg gctccccctc agat 34 



<210> 90 
<211> 3532 
<212> DNA 
<213> HUMAN 

<220> 

<221> mis cofeature which includes the MN gene promoter 
<222> (1) . . (3532) 

<223> region including the transcription initiation site (nucleotide 
3507 of SEQ ID NO: 5 and of Figures 2A-2F) as determined by 
RNase protection assay, which region is inclusive of the MN 
gene promoter, and corresponds to nucleotide 7 to nucleotide 
3538 of SEQ ID NO: 5 and of Figures 2A-2F. 

<220> 

<221> unsure what base is at position 1968 
<222> (1968) 

<223> unsure of the base at position 1968, which is the same unknown 
base at position 1974 of SEQ ID NO: 5 (the full-length MN 
genomic sequence) , position 1968 of SEQ ID NO: 58 and position 
647 of SEQ ID NO: 110. That unknown base is in the region that 
includes the transcription initiation site {nucleotide 3507 of 
SEQ ID NO: 5 and of Figures 2A-2F) as determined by RNase 
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protection assay, which region is inclusive of the MN gene 
promoter. 



<400> 90 

tgttgactcg tgaccttacc cccaaccctg tgctctctga aacatgagct gtgtccactc 60 
agggttaaat ggattaaggg cggtgcaaga tgtgctttgt taaacagatg cttgaaggca 12 0 
gcatgctcgt taagagtcat caccaatccc taatctcaag taatcaggga cacaaacact 180 
gcggaaggcc gcagggtcct ctgcctagga aaaccagaga cctttgttca cttgtttatc 24 0 
tgaccttccc tccactattg tccatgaccc tgccaaatcc ccctctgtga gaaacaccca 300 
agaattatca ataaaaaaat aaatttaaaa aaaaaataca aaaaaaaaaa aaaaaaaaaa 3 60 
aaaagactta cgaatagtta ttgataaatg aatagctatt ggtaaagcca agtaaatgat 42 0 
catattcaaa accagacggc catcatcaca gctcaagtct acctgatttg atctctttat 480 
cattgtcatt ctttggattc actagattag tcatcatcct caaaattctc ccccaagttc 540 
taattacgtt ccaaacattt aggggttaca tgaagcttga acctactacc ttctttgctt 600 
ttgagccatg agttgtagga atgatgagtt tacaccttac atgctgggga ttaatttaaa 660 
ctttacctct aagtcagttg ggtagccttt ggcttatttt tgtagctaat tttgtagtta 720 
atggatgcac tgtgaatctt gctatgatag ttttcctcca cactttgcca ctaggggtag 780 
gtaggtactc agttttcagt aattgcttac ctaagaccct aagccctatt tctcttgtac 840 
tggcctttat ctgtaatatg ggcatattta atacaatata atttttggag tttttttgtt 900 
tgtttgtttg tttgtttttt tgagacggag tcttgcatct gtcatgccca ggctggagta 960 
gcagtggtgc catctcggct cactgcaagc tccacctccc gagttcacgc cattttcctg 102 0 
cctcagcctc ccgagtagct gggactacag gcgcccgcca ccatgcccgg ctaatttttt 1080 
gtatttttgg tagagacggg gtttcaccgt gttagccaga atggtctcga tctcctgact 114 0 
tcgtgatcca cccgcctcgg cctcccaaag ttctgggatt acaggtgtga gccaccgcac 1200 
ctggccaatt ttttgagtct tttaaagtaa aaatatgtct tgtaagctgg taactatggt 1260 
acatttcctt ttattaatgt ggtgctgacg gtcatatagg ttcttttgag tttggcatgc 1320 
atatgctact ttttgcagtc ctttcattac atttttctct cttcatttga agagcatgtt 1380 
atatctttta gcttcacttg gcttaaaagg ttctctcatt agcctaacac agtgtcattg 1440 
ttggtaccac ttggatcata agtggaaaaa cagtcaagaa attgcacagt aatacttgtt 1500 
tgtaagaggg atgattcagg tgaatctgac actaagaaac tcccctacct gaggtctgag 1560 
attcctctga cattgctgta tataggcttt tcctttgaca gcctgtgact gcggactatt 1620 
tttcttaagc aagatatgct aaagttttgt gagccttttt ccagagagag gtctcatatc 1680 
tgcatcaagt gagaacatat aatgtctgca tgtttccata tttcaggaat gtttgcttgt 1740 
gttttatgct tttatataga cagggaaact tgttcctcag tgacccaaaa gaggtgggaa 1800 
ttgttattgg atatcatcat tggcccacgc tttctgacct tggaaacaat taagggttca 1860 
taatctcaat tctgtcagaa ttggtacaag aaatagctgc tatgtttctt gacattccac 192 0 
ttggtaggaa ataagaatgt gaaactcttc agttggtgtg tgtccctngt ttttttgcaa 1980 
tttccttctt actgtgttaa aaaaaagtat gatcttgctc tgagaggtga ggcattctta 2 040 
atcatgatct ttaaagatca ataatataat cctttcaagg attatgtctt tattataata 2100 
aagataattt gtctttaaca gaatcaataa tataatccct taaaggatta tatctttgct 2160 
gggcgcagtg gctcacacct gtaatcccag cactttgggt ggccaaggtg gaaggatcaa 222 0 
atttgcctac ttctatatta tcttctaaag cagaattcat ctctcttccc tcaatatgat 2280 
gatattgaca gggtttgccc tcactcacta gattgtgagc tcctgctcag ggcaggtagc 234 0 
gttttttgtt tttgtttttg tttttctttt ttgagacagg gtcttgctct gtcacccagg 2400 
ccagagtgca atggtacagt ctcagctcac tgcagcctca accgcctcgg ctcaaaccat 2460 
catcccattt cagcctcctg agtagctggg actacaggca catgccatta cacctggcta 252 0 
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atttttttgt atttctagta gagacagggt ttggccatgt tgcccgggct ggtctcgaac 2580 
tcctggactc aagcaatcca cccacctcag cctcccaaaa tgagggaccg tgtcttattc 2640 
atttccatgt ccctagtcca tagcccagtg ctggacctat ggtagtacta aataaatatt 2700 
tgttgaatgc aatagtaaat agcatttcag ggagcaagaa ctagattaac aaaggtggta 2760 
aaaggtttgg agaaaaaaat aatagtttaa tttggctaga gtatgaggga gagtagtagg 2820 
agacaagatg gaaaggtctc ttgggcaagg ttttgaagga agttggaagt cagaagtaca 2880 
caatgtgcat atcgtggcag gcagtgggga gccaatgaag gcttttgagc aggagagtaa 2 940 
tgtgttgaaa aataaatata ggttaaacct atcagagccc ctctgacaca tacacttgct 3000 
tttcattcaa gctcaagttt gtctcccaca tacccattac ttaactcacc ctcgggctcc 3060 
cctagcagcc tgccctacct ctttacctgc ttcctggtgg agtcagggat gtatacatga 3120 
gctgctttcc ctctcagcca gaggacatgg ggggccccag ctcccctgcc tttccccttc 3180 
tgtgcctgga gctgggaagc aggccagggt tagctgaggc tggctggcaa gcagctgggt 3240 
ggtgccaggg agagcctgca tagtgccagg tggtgccttg ggttccaagc tagtccatgg 33 00 
ccccgataac cttctgcctg tgcacacacc tgcccctcac tccaccccca tcctagcttt 3360 
ggtatggggg agagggcaca gggccagaca aacctgtgag actttggctc catctctgca 3420 
aaagggcgct ctgtgagtca gcctgctccc ctccaggctt gctcctcccc cacccagctc 3480 
tcgtttccaa tgcacgtaca gcccgtacac accgtgtgct gggacacccc ac 3532 



<210> 91 
<211> 204 
<212> DNA 
<213> HUMAN 

<400> 91 

cctgcccctc actccacccc catcctagct 

caaacctgtg agactttggc tccatctctg 

ccctccaggc ttgctcctcc cccacccagc 

acaccgtgtg ctgggacacc ccac 



ttggtatggg ggagagggca cagggccaga 60 
caaaagggcg ctctgtgagt cagcctgctc 120 
tctcgtttcc aatgcacgta cagcccgtac 180 

204 



<210> 92 
<211> 132 
<212> DNA 
<213> HUMAN 



<400> 92 

ggatcctgtt gactcgtgac cttaccccca 
ccactcaggg ttaaatggat taagggcggt 
aaggcagcat gc 



accctgtgct ctctgaaaca tgagctgtgt 60 
gcaagatgtg ctttgttaaa cagatgcttg 12 0 

132 



<210> 93 
<211> 275 
<212> DNA 
<213> HUMAN 
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<400> 93 

gcatagtgcc aggtggtgcc ttgggttcca 
ctgtgcacac acctgcccct cactccaccc 
acagggccag acaaacctgt gagactttgg 
tcagcctgct cccctccagg cttgctcctc 
acagcccgta cacaccgtgt gctgggacac 



agctagtcca tggccccgat aaccttctgc 60 
ccatcctagc tttggtatgg gggagagggc 12 0 
ctccatctct gcaaaagggc gctctgtgag 180 
ccccacccag ctctcgtttc caatgcacgt 240 
cccac 275 



<210> 94 
<211> 89 
<212> DNA 
<213> HUMAN 

<400> 94 

ctgctcccct ccaggcttgc tcctccccca cccagctctc gtttccaatg cacgtacagc 60 
ccgtacacac cgtgtgctgg gacacccca 89 



<210> 95 
<211> 61 
<212> DNA 
<213> HUMAN 

<400> 95 

cacccagctc tcgtttccaa tgcacgtaca gcccgtacac accgtgtgct gggacacccc 60 
a 61 



<210> 96 
<211> 116 
<212> DNA 
<213> HUMAN 

<400> 96 

acctgcccct cactccaccc ccatcctagc tttggtatgg gggagagggc acagggccag 60 
acaaacctgt gagactttgg ctccatctct gcaaaagggc gctctgtgag tcagcc 116 



<210> 97 
<211> 36 
<212> PRT 
<213> HUMAN 

<400> 97 

Gly Glu Glu Asp Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp 
15 10 15 
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Pro Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu 
20 25 30 



Glu Asp Leu 
35 



<210> 98 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 98 
Gly Glu Glu 
1 



<210> 99 
<211> 4 
<212> PRT 
<213> HUMAN 

<400> 99 
Glu GlU Asp 
1 



<210> 100 
<211> 5 
<212> PRT 
<213> HUMAN 

<400> 100 
Glu Glu Asp 
1 



<210> 101 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 101 
Glu Asp Leu 
1 



Pro 



Asp Leu Pro 
5 



Leu 



Leu Pro 
5 



Pro Ser Glu 
5 
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<210> 102 
<211> 7 
<212> PRT 
<213> HUMAN 

<400> 102 

Glu Glu Asp Leu Pro Ser Glu 
1 5 



<210> 103 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 103 

Asp Leu Pro Gly Glu Glu 
1 5 



<210> 104 
<211> 22 
<212> PRT 
<213> HUMAN 

<400> 104 

Gly Gly Ser Ser Gly Glu Asp Asp Pro Leu Gly Glu Glu Asp Leu Pro 
15 10 15 

Ser Glu Glu Asp Ser Pro 
20 



<210> 105 
<211> 25 
<212> PRT 
<213> HUMAN 

<400> 105 

Gly Glu Glu Asp Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp 
15 10 15 

Pro Pro Gly Glu Glu Asp Leu Pro Gly 
20 25 
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<210> 106 
<211> 24 
<212> PRT 
<213> HUMAN 

<400> 106 

Glu Asp Pro Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro 
15 10 15 

Gly Glu Glu Asp Leu Pro Glu Val 
20 



<210> 107 
<211> 7 
<212> PRT 
<213> HUMAN 

<400> 107 

Gly Glu Thr Arg Ala Pro Leu 
1 5 



<210> 108 
<211> 7 
<212> PRT 
<213> HUMAN 

<400> 108 

Gly Glu Thr Arg Glu Pro Leu 
1 5 



<210> 109 
<211> 7 
<212> PRT 
<213> HUMAN 

<400> 109 

Gly Gin Thr Arg Ser Pro Leu 
1 5 



<210> 110 
<211> 1247 
<212> DNA 
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<213> HUMAN 



<220> 

<221> misc_f eature 
<222> (1) . . (1247) 

<223> region 5* to the transcription initiation site as determined 
by RNase protection assay (nucleotide 3507 of SEQ ID NO: 5 
and of Figures 2A-2F) in which an activating element is 
localized, which region corresponds to nucleotide 1328 to 
nucleotide 2574 of SEQ ID NO: 5 and of Figures 2A-2F. 

<220> 

<221> unsure what base is at position 647 
<222> (647) 

<223> unsure of the base at position 647, which is the same unknown 
base as that at position 1974 of SEQ ID NO: 5, and as that at 
position 1968 of SEQ ID NOS : 58 and 90. That unknown base at 
position 647 is in a region in which an activating element is 
localized and is 5' to the transcription initiation site. 

<400> 110 

tatgctactt tttgcagtcc tttcattaca tttttctctc ttcatttgaa gagcatgtta 60 

tatcttttag cttcacttgg cttaaaaggt tctctcatta gcctaacaca gtgtcattgt 120 

tggtaccact tggatcataa gtggaaaaac agtcaagaaa ttgcacagta atacttgttt 180 

gtaagaggga tgattcaggt gaatctgaca ctaagaaact cccctacctg aggtctgaga 240 

ttcctctgac attgctgtat ataggctttt cctttgacag cctgtgactg cggactattt 300 

ttcttaagca agatatgcta aagttttgtg agcctttttc cagagagagg tctcatatct 360 

gcatcaagtg agaacatata atgtctgcat gtttccatat ttcaggaatg tttgcttgtg 420 

ttttatgctt ttatatagac agggaaactt gttcctcagt gacccaaaag aggtgggaat 480 

tgttattgga tatcatcatt ggcccacgct ttctgacctt ggaaacaatt aagggttcat 540 

aatctcaatt ctgtcagaat tggtacaaga aatagctgct atgtttcttg acattccact 600 

tggtaggaaa taagaatgtg aaactcttca gttggtgtgt gtccctngtt tttttgcaat 660 

ttccttctta ctgtgttaaa aaaaagtatg atcttgctct gagaggtgag gcattcttaa 720 

tcatgatctt taaagatcaa taatataatc ctttcaagga ttatgtcttt attataataa 780 

agataatttg tctttaacag aatcaataat ataatccctt aaaggattat atctttgctg 840 

ggcgcagtgg ctcacacctg taatcccagc actttgggtg gccaaggtgg aaggatcaaa 900 

tttgcctact tctatattat cttctaaagc agaattcatc tctcttccct caatatgatg 960 

atattgacag ggtttgccct cactcactag attgtgagct cctgctcagg gcaggtagcg 102 0 

ttttttgttt ttgtttttgt ttttcttttt tgagacaggg tcttgctctg tcacccaggc 1080 

cagagtgcaa tggtacagtc tcagctcact gcagcctcaa ccgcctcggc tcaaaccatc 114 0 

atcccatttc agcctcctga gtagctggga ctacaggcac atgccattac acctggctaa 12 00 

tttttttgta tttctagtag agacagggtt tggccatgtt gcccggg 1247 



<210> 111 
<211> 17 
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<212> DNA 
<213> HUMAN 

<400> 111 

ctctgtgagt cagcctg 



<210> 112 
<211> 23 
<212> DNA 
<213> HUMAN 

<400> 112 

aggcttgctc ctcccccacc cag 



<210> 113 
<211> 18 
<212> DNA 
<213> HUMAN 

<400> 113 

agactttggc tccatctc 



<210> 114 

<211> 20 

<212> DNA 

<213> HUMAN 

<400> 114 

cactccaccc ccatcctagc 



<210> 115 
<211> 26 
<212> DNA 
<213> HUMAN 

<400> 115 

999 a 9 a 999 c acagggccag acaaac 



<210> 116 
<211> 20 
<212> PRT 



* 



<213> HUMAN 
<400> 116 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
15 10 15 

Gly Gly Gly Ser 
20 



<210> 117 
<211> 34 
<212> DNA 
<213> HUMAN 

<400> 117 

cgtctagaag gaattcagct agactggctc agca 34 



<210> 118 

<211> 15 

<212> PRT 

<213> HUMAN 



<400> 118 

Glu Val Lys Pro Lys Ser Glu Glu Glu Gly Ser Leu Lys Leu Glu 
15 10 15 



<210> 119 
<211> 12 
<212> PRT 
<213> HUMAN 

<400> 119 

Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro 
15 10 



<210> 120 

<211> 12 

<212> PRT 

<213> HUMAN 

<400> 120 
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Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly 
15 10 



<210> 121 
<211> 10 
<212> PRT 
<213> HUMAN 



<400> 121 

Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro 
15 10 



<210> 122 
<211> 12 
<212> PRT 
<213> HUMAN 

<400> 122 

Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu 
15 10 



<210> 123 
<211> 12 
<212> PRT 
<213> HUMAN 

<400> 123 

Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp 
15 10 



<210> 124 
<211> 12 
<212> PRT 
<213> HUMAN 

<400> 124 

Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu 
15 10 



<210> 125 
<211> 9 
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<212> PRT 
<213> HUMAN 



<400> 125 

Ala Pro Gly Glu Glu Asp Leu Pro Ala 
1 5 



<210> 126 
<211> 9 
<212> PRT 
<213> HUMAN 

<400> 126 

Ala Gly Glu Glu Asp Leu Pro Gly Ala 
1 5 



<210> 127 
<211> 9 
<212> PRT 
<213> HUMAN 

<400> 127 

Ala Glu Glu Asp Leu Pro Gly Glu Ala 
1 5 



<210> 128 
<211> 9 
<212> PRT 
<213> HUMAN 

<400> 128 

Ala Glu Asp Leu Pro Gly Glu Glu Ala 
1 5 



<210> 129 
<211> 9 
<212> PRT 
<213> HUMAN 



<400> 129 

Ala Asp Leu Pro Gly Glu Glu Asp Ala 
1 5 



<210> 130 
<211> 9 
<212> PRT 
<213> HUMAN 

<400> 130 

Ala Leu Pro Gly Glu Glu Asp Leu Ala 
1 5 



<210> 131 
<211> 8 
<212> PRT 
<213> HUMAN 

<400> 131 

Ala Gly Glu Glu Asp Leu Pro Ala 
1 5 



<210> 132 
<211> 8 
<212> PRT 
<213> HUMAN 

<400> 132 

Ala Glu Glu Asp Leu Pro Gly Ala 
1 5 



<210> 133 
<211> 8 
<212> PRT 
<213> HUMAN 

<400> 133 

Ala Glu Asp Leu Pro Gly Glu Ala 
1 5 



<210> 134 
<211> 8 
<212> PRT 
<213> HUMAN 



<400> 134 

Ala Asp Leu Pro Gly Glu Glu Ala 
1 5 



<210> 135 
<211> 8 
<212> PRT 
<213> HUMAN 

<400> 135 

Ala Leu Pro Gly Glu Glu Asp Ala 
1 5 



<210> 136 
<211> 8 
<212> PRT 
<213> HUMAN 

<400> 136 

Ala Pro Gly Glu Glu Asp Leu Ala 

1 5 



<210> 137 
<211> 9 
<212> PRT 
<213> HUMAN 

<400> 137 

Ala Lys Lys Met Lys Arg Arg Lys Ala 
1 5 



<210> 138 
<211> 9 
<212> PRT 
<213> HUMAN 

<400> 138 

Ala lie Thr Phe Asn Ala Gin Tyr Ala 
1 5 



<210> 139 



<211> 9 
<212> PRT 
<213> HUMAN 

<400> 139 

Ala Ser Ala Ser Ala Pro Val Ser Ala 
1 5 



<210> 140 
<211> 9 
<212> PRT 
<213> HUMAN 

<400> 140 

Ala Gly Gin Thr Arg Ser Pro Leu Ala 
1 5 



<210> 141 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 141 

Ser Glu Glu Asp Ser Pro 
1 5 



<210> 142 
<211> 6 
<212> PRT 
<213> HUMAN 

<400> 142 

Arg Glu Glu Asp Pro Pro 
1 5 



<210> 143 
<211> 12 
<212> DNA 
<213> HUMAN 



<400> 143 
agggcacagg gc 



